Mobilizing Text As Data
نویسندگان
چکیده
Textual analysis methods have become increasingly popular and powerful tools for researchers in finance accounting to extract meaningful information from unstructured text data. This paper surveys the recent applications of these various domains, such as corporate disclosures, earnings calls, investor relations, social media. It also discusses advantages challenges different textual methods, keyword lists, pattern-based sequence classification, word embedding, other large language models. We provide guidance on how choose appropriate validate text-based measures, report evidence effectively. conclude by suggesting some promising directions future research using
منابع مشابه
Managing Text as Data
With all their advances, database management systems of the present. generation are designed to handle only data of primitive types, namely, numbers and character strings. Several approaches to extending their capabilities to handle data with higher order semantics exist.. One is to add general abstract data type support. so that users can define such data types easily. In this approach, the DB...
متن کاملText as Data
An ever increasing share of human interaction, communication, and culture is recorded as digital text. We provide an introduction to the use of text as an input to economic research. We discuss the features that make text different from other forms of data, offer a practical overview of relevant statistical methods, and survey a variety of applications.
متن کاملMining Text Data Mining Text Data
Clustering is a widely studied data mining problem in the text domains. The problem finds numerous applications in customer segmentation, classification, collaborative filtering, visualization, document organization, and indexing. In this chapter, we will provide a detailed survey of the problem of text clustering. We will study the key challenges of the clustering problem, as it applies to the...
متن کاملThe Zero-Delay Data Warehouse: Mobilizing Heterogeneous Databases
„Now is the time... for the real-time enterprise“: In spite of this assertion from Gartner Group the heterogeneity of today’s IT environments and the increasing demands from mobile users are major obstacles for the creation of this vision. Yet its technical foundation is available: software architectures based on innovative middleware components that offer a level of abstraction superior to con...
متن کاملSampling the Web as Training Data for Text Classification
Data acquisition is a major concern in text classification. The excessive human efforts required by conventional methods to build up quality training collection might not always be available to research workers. In this paper, the authors look into possibilities to automatically collect training data by sampling the Web with a set of given class names. The basic idea is to populate appropriate ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: European Accounting Review
سال: 2023
ISSN: ['1468-4497', '0963-8180']
DOI: https://doi.org/10.1080/09638180.2023.2218423